Putting language into language modeling

نویسندگان

  • Frederick Jelinek
  • Ciprian Chelba
چکیده

In this paper we describe the statistical Structured Language Model (SLM) that uses grammatical analysis of the hypothesized sentence segment (prefix) to predict the next word. We first describe the operation of a basic, completely lexicalized SLM that builds up partial parses as it proceeds left to right. We then develop a chart parsing algorithm and with its help a method to compute the prediction probabilities P (wi+1jWi): We suggest useful computational shortcuts followed by a method of training SLM parameters from text data. Finally, we introduce more detailed parametrization that involves non-terminal labeling and considerably improves smoothing of SLM statistical parameters. We conclude by presenting certain recognition and perplexity results achieved on standard corpora.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

From PUF to UML

The Unified Modeling Language (UML) is widely used by Software Engineers as the basis of analysis and design in software development. While UML is very strong at specifying the structure and functionality of the application, it is seldom used to its potential to specify usability-related information. The Putting Usability First (P U F) methodology of Usability Engineering identifies and specifi...

متن کامل

Frame-Level Selective Decoding Using Native and Non-native Acoustic Models for Robust Speech Recognition to Native and Non-native Speech

v Regarded as a mismatch problem between the training and test conditions § Training condition: native speech § Testing condition: non-native speech § Widely used methods in speaker or environment adaptation v Research works dedicated to non-native ASR § Acoustic modeling § Pronunciation modeling § Language modeling § Hybrid modeling § Many researches uses a small amount of non-native speech Wh...

متن کامل

Modeling and Non-modeling Genre-based Approach to Writing Argument-led Introduction Paragraphs: A Case of English Students in Iran

Despite the crucial role of introductory sections in argumentative academic writing, the effects of genre- based approaches to writing introductory paragraphs have not been much explored yet. The present study aimed to investigate whether the provision of genre knowledge through modeling and non-modeling could enhance learners’ ability in writing introductory paragraphs of argumentative essays....

متن کامل

Language Proficiency and Identity: Developing a Structural Equation Modeling (SEM) of Identity for Iranian EFL Learners

This study was an endeavor to develop a model of identity among Iranian EFL learners. To achieve this end, a multiphase design was implemented. Initially, it attempted to investigate different factors of identity to propose and validate a model. Thus, 120 EFL learners studying in different English language institutes in Iran were randomly selected, and 36 learners were interviewed about their v...

متن کامل

The Role of Mediation of English Learning Anxiety in the Relationship between Motivational Language Selves Systems and Language Performance

Introduction: Language learning is the product of the complex interaction of internal factors of thinking and cognition, and external factors of emotions and social and cultural interactions. Second language learning anxiety as one of the types of educational anxiety can affect learners' performance. So, the aim of this study was the modeling of English language motivational selves based on lan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999